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This review addresses difficulties arising in estimating 
epidemiological parameters of leukodystrophies and 
lysosomal storage disorders, with special focus on Krabbe 
disease. Although multiple epidemiological studies of Krabbe 
disease have been published, these studies are difficult to 
reconcile since they have used different study populations and 
varying methods of calculation. Confusion exists regarding 
which epidemiological parameters have been estimated; the 
current review shows that most previous estimates can be 
properly interpreted as lifetime risk at birth. One of the most 
common estimation methods is shown to be inaccurate, while 
two other methods are shown to be approximately accurate. 
Based on the results of the current paper, recommendations 
are made that are expected to improve the quality of 
future studies of Krabbe disease. It is anticipated that these 
recommendations will be applicable to epidemiological 
studies of other lysosomal storage disorders, as well as any 
other rare diseases diagnosed with enzymatic screening. 



Introduction 

Krabbe disease (OMIM # 245200) arises as a result of defi- 
cient galactosylceramidase (GALC; E.C. 3.2.1.46) activity, 
which is caused by a mutated GALC gene. GALC is a degra- 
dative lysosomal enzyme, and thus Krabbe disease is classified 
as a lysosomal storage disorder 1,2 in addition to its classifica- 
tion as a leukodystrophy. Since the GALC gene was cloned, 3 
about 100 mutations to the G^4ZCgene have been identified. 4 

Correspondence to: Randy L. Carter; Email: rcarter@buffalo.edu 
Submitted: 03/03/13; Revised: 05/14/13; Accepted: 05/29/13; 
Published Online: 06/18/13 

Citation: Foss AH, Duffner PK, Carter RL Lifetime risk estimators in 
epidemiological studies of Krabbe Disease: Review and Monte Carlo 
comparison. Rare Diseases 2013; 1:e25212; http://dx.doi.org/10.4161/ 
rdis.25212 



Some mutations are associated with well-specified pheno- 
types. For example, individuals homozygous for the 30-kb 
deletion (OMIM # 606890.0002) generally develop the early 
infantile form of the disease. Other mutations, however, lead 
to substantial phenotypic variability, both intrafamiliar and 
otherwise. 5 

Krabbe disease is relatively difficult to diagnose due to its 
rarity and phenotypic heterogeneity. Primary care physicians 
may never see a case in their career. The proportion of Krabbe 
patients that never receive a correct diagnosis is unknown. 
Diagnostic accuracy is likely to vary drastically based on a fam- 
ily's access to adequate medical care and physician experience. 
In an international survey of 334 families self-reported to have a 
member suffering from Krabbe disease, 103 reported the length 
of time between initial symptom onset and correct diagnosis. 
Mean time to diagnosis was 5.3 mo, 6 suggesting a substantial 
difficulty in the diagnosis of Krabbe disease. It has been sug- 
gested that individuals with adolescent and adult forms are par- 
ticularly susceptible to misdiagnosis due to reduced awareness 
of later-onset subtypes. 6,7 While symptom onset during infancy 
is generally associated with a more predictable and fulminant 
clinical course, the course of patients with later onset is quite 
variable, 6,8,9 and this variability is likely to contribute to diagnos- 
tic challenges facing practitioners. 

Enzymatic screening and mutation analysis have improved 
antemortem diagnostic capabilities, offering good sensitivity in 
individuals with putative symptoms of Krabbe disease. As might 
be expected, challenges arise when these methods are applied to 
a large population of asymptomatic individuals. In 2006, New 
York State began screening all newborns for Krabbe disease, and 
by 2008 roughly 550,000 newborns had been screened. Four 
newborns were placed in the high-risk category based on GALC 
activity below 0.15 nmol/h/mg; two of these newborns received 
a positive diagnosis based on subsequent mutation analysis and 
abnormal neurodiagnostic studies. The other two had negative 
neurodiagnostic studies and remained asymptomatic at follow- 
up at 8 and 16 mo of age. Six were placed in the moderate-risk 
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category (GALC between 0.16-0.29 nmol/h/mg protein) and 15 
were placed in the low-risk category (0.3-0.5 nmol/h/mg pro- 
tein). All moderate- and low-risk children were asymptomatic 
at follow-up. 5 Presently, only subsequent follow-up will be able 
to determine whether these asymptomatic at-risk children will 
develop symptoms at a later date, whether they are carriers of the 
disease or whether they simply harbor non-pathogenic GALC- 
lowering polymorphisms.' 

Epidemiological analysis must also consider missed diagno- 
ses due to censored cases: individuals who would have developed 
symptoms of Krabbe disease but died of unrelated causes before 
symptom onset. The impact of censoring on measures of lifetime 
risk at birth will be quantified in subsequent sections of the cur- 
rent paper. 

Previous Epidemiological Studies of Krabbe Disease 

The literature on the epidemiology of Krabbe disease was sur- 
veyed, and any article that reported an estimated rate of disease 
(prevalence, incidence, birth prevalence, etc.) was considered for 
review. 33 " 36 These previous epidemiological studies of Krabbe 
disease are summarized in Table 1. There are numerous pub- 
lished studies that reported raw numbers of patients suffer- 
ing from Krabbe disease in a given population. 10 " 15 Since these 
studies did not estimate disease rates or proportions relative to 
a reference population, they will not be reviewed in the current 
paper. Seven studies aimed to estimate the proportion of indi- 
viduals who will eventually be diagnosed with the disease in a 
given birth cohort, 7 ' 16 " 21 which, as shown below, is best referred to 
as lifetime risk at birth. Three studies estimated lifetime risk for 
other disease categories, but also provided sufficient information 
to estimate it for Krabbe disease. 22 " 24 Three studies estimated this 
proportion for isolated sub-populations within Israel and will not 
be reviewed further. 25 " 27 Several book chapters provided epide- 
miological estimates without describing the methods used for 
estimation. 28 " 31 

The studies reporting the proportion of individuals affected 
by Krabbe disease all generally sought to approximate the same 
quantity: the proportion of newborns that will develop the dis- 
ease at any point in their life. It will be asserted below that this 
quantity corresponds best to lifetime risk at birth. Reported cal- 
culations typically involved dividing the number of diagnoses 
in a given time period by the number of births within a corre- 
sponding time period. A broad and often conflicting variety of 
terms were used to describe this quantity in the literature. Terms 
used include incidence (e.g., ref. 16), prevalence (e.g., ref. 18) 
and birth prevalence (e.g., ref. 20). Meikle et al. 18 used the term 
prevalence to denote calculations including pre- and postnatal 
diagnoses and incidence to denote calculations that include only 
postnatal cases. 

The usage of the terms above to describe the proportion in 
question does not match formal definitions in epidemiological 
publications. This misuse of terminology in the literature pres- 
ents two problems that will subsequently be addressed: first, 
identifying what parameter these studies intended to estimate 
and second, evaluating the accuracy of those estimations. 



What parameters are estimated? Given common incorrect 
use of terms used to describe these epidemiological quantities, 
the correct interpretations of reported rates in the seven stud- 
ies listed above 716 " 21 must be inferred from their stated meth- 
ods. The proportion of newborns that are, or will be, affected 
by a disease is best described as birth prevalence or lifetime risk 
at birth, depending on the nature of the disease being studied. 
For conditions that are exclusively incident at or near birth, 
such as cleft palate, they are equivalent, since the number of 
affected newborns is equal to the number of individuals who 
will develop the disease at any point in their lives. For conditions 
with a variable age of onset, however, birth prevalence refers only 
to those who have the disease at birth. Since the majority of 
studies included juvenile or adult onset Krabbe cases, 7,17 " 20 ' 22 " 24 
birth prevalence does not adequately describe the estimates in 
question. It is conceivable that the studies could be describing 
the birth prevalence of necessary and sufficient factors for devel- 
oping the disease later in life, but it is not fully known what 
these factors are (as illustrated above by the challenges of new- 
born screening and unknown predictive validity of many patho- 
genic alleles). We propose that lifetime risk at birth describes 
the desired proportion more effectively. Lifetime risk is a spe- 
cial case of cumulative incidence (which we distinguish from 
incidence rate) in which the period of time studied is the entire 
remaining lifetime. Lifetime risk can be estimated conditional 
on disease-free survival up to a specified age, and thus lifetime 
risk at birth should be specified as such. For discussions of all 
the above epidemiological terms, readers are referred to several 
resources listed in the bibliography. 32 " 37 

The seven studies listed above do not capture information 
regarding which age groups are at greater risk of death due to 
Krabbe disease. Only one study 38 aimed to measure age-specific 
mortality rates. Using national and state death certificates in the 
United States, the number of deaths of individuals with Krabbe 
disease was estimated for the years 1999-2004. Mortality rates 
were estimated separately for children less than five years of age 
and compared with mortality rates of individuals greater than or 
equal to five. It was found that mortality in children less than 
five years of age (0.994 deaths per year per million children) was 
greater than mortality in older individuals (0.007 deaths per year 
per million individuals). Barczykowski et al. 38 also showed that 
the yearly mortality rate in children less than 5 y of age due to 
Krabbe disease is a good approximation to the incidence rate of 
the early infantile form of Krabbe disease. 

It is not possible to evaluate the estimates provided in any 
available book chapters, 28 " 31 since the methods used were not 
described. These estimates are referred to as incidence, although 
estimates are not given in units involving time, so they cannot be 
incidence rates. It is presumed that these estimates correspond to 
lifetime risk at birth, as described above, but without a descrip- 
tion of the calculations used this cannot be determined. 

How accurate are the estimates? Although it can be assumed 
that the seven studies in question 7,16 " 21 produced ratios that can be 
interpreted as estimates of lifetime risk from birth, it is not imme- 
diately clear that they succeeded in measuring this quantity accu- 
rately. These studies did not follow cohorts longitudinally, but 
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Table 1 . Previous studies of Krabbe epidemiology 



Reference 


Methods 
described? 


Study 
population 


Reported 
measure 


Numerator 


Denominator 


Estimate, per 
1 million units 


Hagberg etal. 1969 ° 


Yes 


Sweden, 1953-1967 


incidence 


32 postnatal cases born 
1953-1967 


1.66 million births 
1953-1967 


19 


Heimetal. 1997" 


Yes; DOB 


Germany, 
1981-1989 


incidence 

O 


34 cases born 1981 - 1989 
and diagnosed before 1994 


All births 1981-1989 


6.0 


Meikleetal. 1999 s 


Yes; Dx 


Australia, 
1980-1996 


preva- 
lence 


30 pre- and postnatal diag- 
noses 1980-1996 


4,222,323 births 
1980-1996 


7.1 


Meikleetal. 1999" 


Yes; Dx 


Australia, 
1980-1996 


incidence 


21 postnatal diagnoses 
1980-1996 


4,222,323 births 
1980-1996 


5.0 


Poorthuis et al. 

1999 20 


Yes; DOB 


Netherlands, 
1971-1995 


birth prev- 
alence 


63 pre- and postnatal diag- 
noses 1971-1996 


4,677,849 births 
1971-1995 


13.5 


Ozkara and Topcu 

2004 21 


Yes 


Turkey, 1997-2002, 
age < 5 


incidence 


65 postnatal cases diagnosed 
1997-2002, born 1997-2002 


6,500,000 live births 
between 1997-2002 


10 


Pinto etal. 2004' 


Yes; DOB 


North Portugal, 
1984-1999 


birth prev- 
alence 


9 pre- and postnatal diagno- 
ses 1988-2001 


All births 1984-1999 


12.1 


Poupetova et al. 
2010 


Yes; DOB 


Czech Republic, 
1975-2008 


birth prev- 
alence 


13 cases born 1977-2002, 
diagnosed 1975-2008 


3,249,150 live births, 
1977-2002 


4.0 (2.1 - 6.8)§ 


Barczykowski et al. 

201 2 38 


Yes 


United States, 
1999-2004, age < 5 


mortality 
rate 


19.40 expected deaths per 
year due to Krabbe disease 


19,521,730 average pop- 
ulation size, age < 5 


0.994 


Barczykowski et al. 

201 2 3S 


Yes 


United States, 
1999-2004, ages 5 


mortality 
rate 


1.87 expected deaths per 
year due to Krabbe disease 


266,827,247 average 
population size, age > 5 


0.007 


Zlotogora et al. 

1985 25 


Yes 


Druze in Israel 
1969-1983 


incidence 


12 postnatal cases born 
1969-1983 


2000 births 1969-1983 


6000 


Zlotogora et al. 
1991 26 


Yes 


Jews in Israel, 
1973_1987 


incidence 


0 cases 1969-1983 


1,000,000 births 
1969-1983 


0 


Zlotogora et al. 
1991 26 


Yes 


Muslim Arabs in 
Jerusalem, 
1973_1987 


incidence 


6 postnatal cases born 
1979-1987 


3731 births 1973-1987 


1610 


Macarov etal. 2011 27 


Yes 


Muslim Arabs in 
Jerusalem 
1999-2002 


incidence 


6 postnatal cases born 
1999-2002 


3600 live births 
1999-2002 


1670 


Macarov etal. 201 1 2 ' 


Yes 


Muslim Arabs in 
Jerusalem, 

ZUU3-ZUU/ 


incidence 


4 postnatal cases born 
2003-2007 


4876 live births 
2003-2007 


820 


Suzuki and Suzuki 

1 983 28 


No 


Japan, 8 y period 


incidence 


17 cases over an 8 y period 


Births 


2-3 


Suzuki etal. 1995"' 


No 


Japan, 1972-1986 


incidence 


25 cases 


Births 


5-10 


Suzuki etal. 1995* 29 


No 


France 


incidence 


Cases of Krabbe disease 


Live births 


6.7 


Wenger etal. 2001 30 


No 


United States 


incidence 


Cases of Krabbe disease 


Births 


10 


Pastores and 
Kolodny 2006 31 


No 


"General popula- 
tion" 


incidence 


Cases of Krabbe disease 


No description 


5.0 


Applegarth et al. 

2000 22 


NA t 


British Columbia, 
1972-1996 


Lifetime 
risk at 
birth t 


2 postnatal, 1 prenatal cases 
born 1972-1996, diagnosed 
1972-1997 


1,035,816 live births 
1972-1996 


2.9 t 


Dionisi-Vici et al. 

2002 23 


NAt 


Italy, 1985-1997, 
< 18yold 


Lifetime 
risk at 
birth t 


36 postnatal cases diag- 
nosed 1985-1997 


7,173,959 births 
1985-1997 


5.0 t 


Bonkowsky et al. 


NA t 


Utah, 1999-2007, 


Lifetime 


2 postnatal diagnoses 


451,171 live births, Utah, 


4.4 t 



2010 24 <18yold riskat 1999-2007 1999-2007* 

birth t 

As discussed in the current paper, many of these estimates are inaccurate or uninterpretable. *Vanier MT, cited in Suzuki et al. 29 as personal com- 
munication; OHeim et al. 17 reported incidence but cited the method of Claussen et al., 45 whose method is described therein as cumulative incidence; 
tcalculated from reported data by current author A.F., using the Dx method; note that original authors may not have intended data to be used in this 
manner; §95% Poisson confidence interval; $ Houston et al. 46 
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Table 2. Definitions of key terms and the three estimation methods evaluated in the Monte Carlo (MC) simulations 



Term 


Definition 


Diagnosis Period 


The period of time in which dates of Krabbe diagnoses were collected. 


Observed Case 


An individual receiving a Krabbe diagnosis during the diagnosis period. 


Censored Case 


An individual who would have been an observed case, but died of unrelated causes before symptom onset. 


Birth Period 


The period of time between the earliest and latest birthdate of all observed cases. 


ux ivicinou 


Method of estimating lifetime risk at birth calculated by taking the number of observed cases divided by the number of total 
births during the diagnosis period. Although it is slightly more biased than the LT method, it is often the best possible method for 

studies of Krabbe disease. 


DOB Method 


Method of estimating lifetime risk at birth calculated by taking the number of observed cases divided by the number of total 
births during the birth period. A biased method that should not be used. 


Life-Table (LT) 
Method 


Method of estimating lifetime risk at birth that adjusts for censored cases. For a full description see, for example, Beiser et al. 32 The 
best method reviewed, but often unusable for studies of Krabbe disease due to the unavailability of appropriate data. 



rather obtained the dates of all Krabbe diagnoses in a given time 
period (hereafter referred to as diagnosis period). Two distinct 
methods were used to calculate the denominator of the ratios 
reported in these papers. As will be shown below, one method is 
flawed, while the other yields approximately accurate estimates 
of lifetime risk. 

Meikle et al. 18 estimated lifetime risk at birth using a method 
referred to in the current study as the "Dx" method (see Table 
2). In this study, it was explicitly assumed that the number of 
diagnoses in the diagnosis period was equal to the number of 
affected births within that same diagnosis period (affected births 
referring to any individual who would ever develop symptoms) 
and that the appropriate denominator must be the number of 
births occurring in the diagnosis period. Assuming that the 
effect of censoring is minimal, a constant birth rate, a constant 
diagnosis rate and that all affected individuals receive a diag- 
nosis, this equivalence must hold true, as shown in Figure SI. 
Therefore, given these assumptions, the Dx method appears to 
be an approximately accurate means of estimating the lifetime 
risk of the disease. 

Four studies 7,17,19 ' 20 estimated lifetime risk at birth using a 
method referred to in the current study as the "DOB" method 
(see Table 2). In these studies, the earliest and latest birth date 
of cases diagnosed in the diagnosis period was determined. The 
total number of births in the population within this birth range 
was used as the denominator. While this method ensures that 
all potentially relevant births are reflected in the denominator, it 
introduces a serious source of error. Any case of Krabbe disease 
that was born within the birth range and died before the begin- 
ning of the diagnosis period remains uncounted in the numera- 
tor, resulting in underestimated lifetime risk. Furthermore, the 
beginning and end of the birth period is determined by the 
two most extreme birthdates. Since the sample range of birth- 
dates increases with increasing sample size, the severity of this 
error increases with increased sample size. This source of error 
was recognized by some study authors, who removed cases that 
they believed would excessively inflate the denominator. 7,17,20 
None of these authors reported a quantitative method of select- 
ing cases for removal, and thus their method is not reproducible 
by a third party. It appears that most of the cases were removed 
due to an early birth year, which, had they not been removed, 



would have greatly increased the number of births included in 
the denominator. 

The accuracy of the Dx and DOB methods, and the influence 
of censored cases on the Dx method, is further analyzed in a 
Monte Carlo simulation study that is described in the next sec- 
tion of the current paper. 

Ozkara and Topcu 21 studied the occurrence of Krabbe disease 
in children 0-5 y of age during the five year period from 1997- 
2002 in Turkey. They included for analysis all Krabbe patients 
both born and diagnosed within 1997-2002 and divided this 
number by the number of total births observed within 1997- 
2002. This method is not accurate, as the birth cohorts in the 
study period are not followed for equal amounts of time. 

Hagberg et al. 16 studied cases of Krabbe disease that had been 
born from 1953-1967. They divided the total number of affected 
cases born during this time period by the total number of births 
during this time period. In this particular sample, however, the 
patients with the earliest and latest birthdates died in the same 
year they were born. In this special case, the method used by 
Hagberg et al. 16 is equivalent to the Dx method; thus, the conclu- 
sions reported herein regarding the Dx method are applicable to 
Hagberg's method. 

Case ascertainment. In addition to the various ratios that 
have been reported and the various calculations used to approxi- 
mate lifetime risk at birth, varying methods of case ascertain- 
ment have been used in the epidemiological studies reviewed 
above. These methodological differences are large enough to 
render direct comparisons across studies difficult, if not impos- 
sible. Most studies included patients of any age, while others 
included restricted age ranges. 23,24 Some studies included fetuses 
diagnosed prenatally in the analysis (e.g., ref. 19), whereas oth- 
ers do not appear to have done so (e.g., ref. 16). Studies report 
different coverage of the observed population; e.g., Heim et al. 17 
reported a response rate less than 80% in the queried medical 
institutions, whereas Hagberg et al. 16 reported a 100% response 
rate). Additionally, there is likely to be a non-negligible num- 
ber of undiagnosed patients with Krabbe disease in any popula- 
tion; the rate of undiagnosed cases is likely to vary with access to 
adequate primary health care providers, access to specialists in 
pediatric neurology and physician experience. Within any study, 
these factors are largely unknown, rendering corrections for these 
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sources of error impossible. Naturally, the measured proportions 
are also affected by the true disease proportion in the study popu- 
lation. Unfortunately, it is impossible to separate these last two 
sources of between-study variability, and it remains possible that 
country-level differences are merely attributable to country-spe- 
cific rates of missed diagnoses of Krabbe disease. 

Monte Carlo Evaluation of Methods 
for Lifetime Risk Estimation 

A Monte Carlo (MC) simulation study was implemented to fur- 
ther evaluate the accuracy of two methods commonly used to 
estimate lifetime risk at birth of Krabbe disease: the Dx method 
and the DOB method (see Table 2). A third method which we 
shall refer to as the life-table (LT) method 32 was also evaluated. 
The latter method has the advantage of correcting for error due to 
censored cases and yielded the most accurate estimates. Although 
the LT method is often not practical for studies of Krabbe disease 
due to the extensive data required, we have included it in the 
simulation to serve as a reference to which the other methods can 
be compared. 

A MC study is a computer simulation in which random sam- 
ples are repeatedly generated in order to study a phenomenon of 
interest. 3 ' MC simulations have been used to investigate difficult 
problems across a broad range of disciplines, e.g., evolutionary 
biology, 40 radiology, 41 and finance. 42 In the current study, sam- 
ples of populations with a known lifetime risk at birth of Krabbe 
disease were generated. Since the lifetime risk was known, the 
accuracy of the estimation methods could be calculated directly. 
Because the accuracy of the estimation methods were affected 
by fixed population parameters, these parameters were system- 
atically varied to simulate the range of populations studied in 
previous studies of Krabbe epidemiology. All simulations were 
implemented using the R programming language. 43 

Parameters and distributional assumptions of MC simula- 
tion. The four input parameters to the (MC) study were: T, the 
length of the diagnosis window in which cases could be regis- 
tered; N, the constant number of total births each year (both 
Krabbe and non-Krabbe cases); LR, the true lifetime risk at birth 
of the disease; and A, the average age at diagnosis of the disease. 
It was assumed that the birth, diagnosis and age-specific death 
rates did not change with calendar time and that there was no 
migration into or out of the observed population. It is impor- 
tant to note that a constant birth rate guarantees that the Dx 
method performs optimally; any violation of this assumption 
would result in biased estimates. All settings of the parameters 
are listed in Table 3. 500 MC samples were generated for each of 
4320 unique combinations of parameters. 

The birthdate and diagnosis date for each individual with 
Krabbe disease were simulated independently as the dates of 
randomly generated events in Poisson processes. Birth events 
were simulated by generating inter-arrival times between suc- 
cessive affected births independently drawn from an exponen- 
tial distribution with rate parameter equal to N times LR until 
the diagnosis window was exceeded (note that the number of 
affected births simulated was always much less than AO - Starting 



Table 3. Parameter settings used during the Monte Carlo (MC) simula- 
tion 



Parameter 


Settings 


Length of Diagnosis Period 


5 - 30 y in increments of 5 


Total Births Per Year 


50,000 - 1,000,000 births in increments 




of 50,000 


True Lifetime Risk at Birth 


0.25, 0.5, 0.75, 1, 2, and 3 per 100,000 


Expected Age at Diagnosis 


0.3,1,2, 3, 4,5 



A series of MC simulation studies were conducted in which the total 
number of births per year, along with the birth dates and diagnosis 
dates of individuals with Krabbe disease, were simulated. Each MC simu- 
lation study was conducted using a distinct combination of parameter 
settings; the range of values used for each parameter is listed here. The 
range for each parameter was chosen to reflect the range of values 
observed in published studies of Krabbe disease. 

with the birthdate, each date of diagnosis was determined by 
drawing a corresponding age from an independent exponen- 
tial distribution with rate parameter equal to 1 + A. In order 
to ensure that every single affected individual that could have 
survived into the diagnosis period was considered, birth and 
diagnosis dates were generated for the 110 y prior to the start 
of the diagnosis period. 

For each affected birth, the probability of censoring was deter- 
mined based on all-cause mortality rates in the US in 2009. 44 See 
Supplemental Methods for a detailed description of censoring 
calculations. 

Lifetime risk calculations and criteria for comparison. 

Lifetime risk was estimated using the Dx, DOB and LT methods. 
The numerator of both the Dx method and the DOB method 
used the number of uncensored diagnoses observed in the diag- 
nosis period. The Dx method divided this number by the total 
number of births (affected and unaffected) occurring within the 
diagnosis period (i.e., NxT), The DOB method, on the other 
hand, first required ascertainment of the earliest and latest birth- 
dates of the diagnosed cases. The time between the earliest and 
latest birthdates is hereafter referred to as the "birth period." The 
number of total births occurring within the birth period (i.e., N 
x length of birth period) was divided into the number of observed 
diagnoses. The LT method is more intricate than the Dx and 
DOB methods and is described in detail elsewhere. 32 In addition 
to the information required by the Dx and DOB methods, the 
LT method requires the number of total deaths (due to all causes) 
per year and age at death for each member of the entire study 
population. The LT method uses this all-cause mortality data to 
estimate the number of censored cases. 

Percent bias was calculated for each of the three estimation 
methods. Percent bias gives the percent over- or underestimation 
of a method as a percent of the true parameter value. Regression 
analysis was used to assess the associations between censoring, 
parameter settings and bias. The average percent bias of each 
method across all parameter settings was used to draw conclu- 
sions about the accuracy of each method. 

MC results. The average percent bias scores for each method 
are listed in Table 4, and the results of the regression analy- 
ses can be found in Table 5. On average, the DOB method 
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Table 4. Results of the Monte Carlo (MC) Study: Accuracy of three esti- 
mation methods 



% Bias 


DOB 


Dx 


LT 


IstQuartile 


-42.50 


-0.89 


-0.49 


Median 


-24.60 


-0.50 


-0.12 


Mean 


-24.90 


-0.51 


-0.13 


3rd Quartile 


-7.73 


-0.12 


0.24 



Percent bias describes the percent over- or underestimation of the true 
value, in this case the true lifetime risk at birth of Krabbe disease. Since 
the accuracy of an estimation method is influenced by factors that 
vary from study to study, many studies were simulated. How well each 
estimation method performed across all 2,1 60,000 simulated studies is 
described here. On average, the DOB method was significantly more 
inaccurate than either the Dx or LT method; and the Dx method was 
significantly more inaccurate than the LT method (p < 0.01 for all three 
comparisons, using paired t-tests and the Sidak correction for multiple 
comparisons). 

underestimates true lifetime risk at birth by 24.9%, although 
this fluctuates dramatically based on the parameter settings used 
(significant associations between percent bias and each param- 
eter setting, all p < 0.001). The Dx method underestimates 
by an average of 0.5%, with a significant association between 
censoring and percent bias (p < 0.001). The LT method is the 
most accurate, with an average underestimation of 0.1%, with 
a significant association between censoring and percent bias 
(p < 0.05) and between expected age at diagnosis and percent 
bias (p < 0.001). 

Discussion and Recommendations for Future 
Research 

Epidemiological measures are an important source of informa- 
tion for monitoring the morbidity and mortality of diseases in a 
population. They are also critical outcome measures for public 
health interventions. See the report of Macarov et al. 27 for an 
example in which epidemiological measures are used to docu- 
ment the success of a screening program for Krabbe disease 
implemented in a community in Israel. In the current review, 
we describe various problems that limit the utility of many other 
previously published epidemiological estimates of Krabbe disease 
rates in larger populations. 

As described above, extant studies suffer from inappropri- 
ate terminology applied to epidemiological measures of Krabbe 
disease; in the current review, we show that nearly all previous 
estimates can be correctly interpreted as lifetime risk at birth. 
Previous studies often employ flawed methods to estimate life- 
time risk at birth. In the current review, the DOB method was 
shown to be substantially inaccurate, while the Dx method and 
the LT method suffered from minimal error that was primarily 



attributable to censoring. The LT method was more accurate 
than the Dx method, as it is able to correct for most of the error 
introduced by censored cases. However, the difference in percent 
bias between the Dx and LT methods was small (less than one 
percentage point) . 

Due to the methodological issues discussed above, and given 
the Monte Carlo study results above, we recommend use of the 
LT method, when possible, to estimate the lifetime risk of Krabbe 
disease. In many cases, however, since the data required by the 
LT method is difficult to obtain in studies of Krabbe disease, the 
Dx method is an acceptable substitute as long as the population 
studied has a constant birth rate and diagnosis rate. The DOB 
method is inaccurate and should not be used. Aside from being 
the most accurate method, the LT method is superior for at least 
four other reasons: (1) it can yield estimates of remaining life- 
time risk at ages other than birth; (2) it can be used to calculate 
estimates of standard error; (3) unlike the Dx method, the LT 
method is accurate whether or not the number of births per year 
remains constant; and (4) it is less affected by censoring. Since 
the association between censoring and bias is greater when the 
disease of interest arises later in life, the LT method is the only 
reasonable choice if the disease has a later onset. 

As mentioned above, the between-study differences in case 
ascertainment, statistical methodology, and diagnostic capa- 
bilities are large enough that comparison of rates across studies 
is highly questionable. Future studies should pay special atten- 
tion to case ascertainment, especially for older individuals with 
Krabbe disease who are more likely to be missed. If possible, 
unborn cases that have been diagnosed prenatally should be 
included in the estimated rate. 

Age-specific mortality rates have been published for the 
United States, 38 but have not been published for any other coun- 
try. Although not treated at length in the current paper, mortality 
rates provide useful information and should be similarly esti- 
mated and published for countries outside of the United States. 

The results and conclusions of the current study are directly 
applicable to many disorders besides Krabbe disease. The DOB 
and Dx methods reviewed above have been applied in previous 
studies to various leukodystrophies, 17 sphingolipidoses, 21 lyso- 
somal storage disorders, 7,18 " 20 and inborn errors of metabolism, 22 ' 23 
and all of the recommendations in the current paper are relevant 
to these disease categories as well. 
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Table 5. Regression results for bias analysis 





DOB 




Dx 




LT 








Estimate 


p 


Estimate 


p 


Estimate 


p 




Intercept 


-1.06e+02*** 


<2e-16 


-3.08e-02 


0.59 


-1.11e-01 


0.05 




Study/Birth Ratio 


9.77e+01*** 


<2e-16 












Study Period Length 






-6.89e-04 


0.60 


-6.59e-04 


0.62 




Total Births per Year 


4.85e-03*** 


0.00058 


-4.20e-04 


0.29 


-3.89e-04 


0.33 




True Lifetime Risk 


2.66e-01*** 


6.2e-11 


2.06e-02 


0.069 


2.09e-02 


0.067 




Expected Age at Diagnosis 


7.22e-01*** 


< 2e-16 


3.76e-03 


0.79 


5.93e-02*** 


0.000040 




Percent Censored 


2.02e+03*** 


< 2e-16 


-4.73e+02*** 


5.2e-12 


-1.53e+02* 


0.026 






R 2 


99.03% 




5.17% 




0.79% 










* p < 0.05 


** p < 0.01 


***p < 0.001 










Percent bias scores were analyzed 


separately for each estimation method (DOB, Dx, LT). 
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